Detecting rumours with latency guarantees using massive streaming data

نویسندگان

چکیده

Today’s social networks continuously generate massive streams of data, which provide a valuable starting point for the detection rumours as soon they start to propagate. However, rumour faces tight latency bounds, cannot be met by contemporary algorithms, given sheer volume high-velocity streaming data emitted networks. Hence, in this paper, we argue best-effort that detects most quickly rather than all with high delay. To end, combine techniques efficient, graph-based matching patterns effective load shedding discards some input while minimising loss accuracy. Experiments large-scale real-world datasets illustrate robustness our approach terms runtime performance and accuracy under diverse conditions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overload Management in Data Stream Processing Systems with Latency Guarantees

Stream processing systems are becoming increasingly important to analyse real-time data generated by modern applications such as online social networks. Their main characteristic is to produce a continuous stream of fresh results as new data are being generated at real-time. Resource provisioning of stream processing systems is difficult due to time-varying workload data that induce unknown res...

متن کامل

Approximated Bayesian Inference for Massive Streaming Data

Extracting meaningful information out of massive streaming data is a significant challenge due to the high dimensionality of the inference problem and limits on available computational power and memory. While Bayesian models often convey significant inferential advantages, standard computational algorithms relying on Markov chain Monte Carlo are infeasible to apply. This motivates online variat...

متن کامل

Towards Detecting Rumours in Social Media

The spread of false rumours during emergencies can jeopardise the well-being of citizens as they are monitoring the stream of news from social media to stay abreast of the latest updates. In this paper, we describe the methodology we have developed within the PHEME project for the collection and sampling of conversational threads, as well as the tool we have developed to facilitate the annotati...

متن کامل

Computational Graph Analytics for Massive Streaming Data

Handling the constant stream of data from health care, security, business, and social network applications requires new algorithms and data structures. We present a new approach for parallel massive analysis of streaming, temporal, graph-structured data. For this purpose we examine data structure and algorithm trade-offs that extract the parallelism necessary for high-performance updating analy...

متن کامل

Streaming Algorithms for Distributed, Massive Data Sets

Massive data sets are increasingly important in a wide range of applications, including observational sciences, product marketing, and monitoring and operations of large systems. In network operations, raw data typically arrive in streams, and decisions must be made by algorithms that make one pass over each stream, throw much of the raw data away, and produce \synopses" or \sketches" for furth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Vldb Journal

سال: 2022

ISSN: ['0949-877X', '1066-8888']

DOI: https://doi.org/10.1007/s00778-022-00750-4